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Abstract —In this paper we study the data exchange problem 
where a set of users is interested in gaining access to a 
common hie, but where each has only partial knowledge about 
it as side-information. Assuming that the file is broken into 
packets, the side-information considered is in the form of 
linear combinations of the file packets. Given that the collective 
information of all the users is sufficient to allow recovery of 
the entire file, the goal is for each user to gain access to the 
file while minimizing some communication cost. We assume 
that users can communicate over a noiseless broadcast channel, 
and that the communication cost is a sum of each user’s cost 
function over the number of bits it transmits. For instance, the 
communication cost could simply be the total number of bits 
that needs to be transmitted. In the most general case studied 
in this paper, each user can have any arbitrary convex cost 
function. We provide deterministic, polynomial-time algorithms 
(in the number of users and packets) which find an optimal 
communication scheme that minimizes the communication cost. 
To further lower the complexity, we also propose a simple 
randomized algorithm inspired by our deterministic algorithm 
which is based on a random linear network coding scheme. 
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I. Introduction 

In recent years cellular systems have witnessed signifi¬ 
cant improvements in terms of data rates, and are nearly 
approaching the theoretical limits in terms of the physical 
layer spectral efficiency. At the same time, the rapid growth in 
the popularity of data-enabled mobile devices, such as smart 
phones and tablets, and the resulting explosion in demand 
for more throughput are challenging our abilities to deliver 
data, even with the current highly efficient cellular systems. 
One of the major bottlenecks in scaling the throughput with 
the increasing number of mobile devices is the “last mile” 
wireless link between the base station and the mobile devices 
- a resource that is shared among many users served within 
the cell. This motivates the study of paradigms where cell 
phone devices can cooperate among themselves to get the 
desired data in a peer-to-peer fashion without solely relying 
on the base station. 

The material in this paper appears in part in m (a 
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Fig. 1. An example of the data exchange problem. A base station has a file 
formed of six packets wi,, wq € Fg and wants to deliver it to three users 
over an unreliable wireless channel. The base station stops transmitting once 
all users collectively have all the packets, even if individually they have only 
subsets of the packets (Stage 1). Users can then cooperate among themselves 
to recover their missing packets by broadcasting over a noiseless public 
channel (Stage 2). It can be shown that the minimum number of symbols in 
¥q needed for the file recovery at all users is 5. A communication scheme 
that achieves this minimum is: user 1 transmits wi, user 2 transmits W 2 -\-'W 4 , 
while user 3 transmits w^, w^, wq. Now, if the goal is to allocate these 
5 transmissions to the users as uniformly as possible, user 1 transmits w\, 
user 2 transmits W 2 + iU 4 , ws, and user 3 transmits 103 , wq. 

An example of such a setting is shown in Figure [T] where 
a base station wants to deliver the same file to multiple 
geographic ally-close users over an unreliable wireless down¬ 
link. In the example of Figure [T] we assume that the file 
consists of six equally sized packets wi, W2, W3, W4, W5 
and Wq belonging to some finite field Fg. Suppose that after 
a few initial transmission attempts by the base station, the 
three users individually receive only parts of the file (see 
Figure [T]), but collectively have the entire file. Now, if all 
users are in close vicinity and can communicate with each 
other, then, it is much more desirable and efficient, in terms 
of resource usage, to reconcile the file among users by letting 
all of them “talk” to each other without involving the base 
station. The cooperation among the users has the following 
advantages: 

■ Local communication among users has a smaller foot- 













print in terms of interference, thus allowing one to use 
the shared resources (code, time or frequency) freely 
without penalizing the base station’s resources, i.e., 
higher resource reuse factor. 

• Transmissions within the close group of users is much 
more reliable than from the base station to any terminal 
due to geographical proximity of terminals. 

« This cooperation allows hie recovery even when the 
connection to the base station is either unavailable after 
the initial phase of transmission, or it is too weak to 
meet the delay requirement. 

Let us consider the example in Figure [T] and let user 1, 
user 2 and user 3 transmit and i ?3 symbols in F^, 

respectively. It can be shown that the minimum total number 
of symbols in needed to recover the hie is 5. One possible 
communication scheme that achieves it is: user 1 transmits 
wi, user 2 transmits W 2 +W 4 , while user 3 transmits ws, W 5 , 
wq. Note that the load of the communication of the system 
is unevenly distributed among the users, i.e., user 3 transmits 
3 out of 5 symbols in Fg. The next question we ask here is 
out of all communication schemes that deliver the entire hie 
to the users in the minimum number of transmissions, which 
one distributes the load of communication to the users as 
fair as possible. For instance, for the same minimum number 
of transmissions, we can have the following scheme: user 1 
transmits wi, user 2 transmits W 2 + W 4 , W 5 , and user 3 
transmits W 3 , wq. Intuitively, this scheme is more fai£] than 
the previous one since it spreads the transmissions more 
uniformly among the users. And, it can be shown that such 
scheme minimizes a convex fairness cost. 

In the example from Figure [T] we considered only a 
simple form of side-information, where different users ob¬ 
serve subset of uncoded “raw” packets of the original hie. 
Content distribution networks a-ia are increasingly using 
codes, such as linear network codes or Fountain codes 11, 
to improve the system efficiency. In such scenarios, the side- 
information representing the partial knowledge gained by the 
users would be coded and in the form of linear combinations 
of the original hie packets, rather than the raw packets 
themselves. We refer to this model of side-information as 
a linear packet model. 

Contributions 

In this paper, we study the data exchange problem under 
the linear packet model and the separable convex communi¬ 
cation cost. Such cost captures all the communication objec¬ 
tives discussed earlier: 1. Minimization of the (weighted) sum 
of bits users need to exchange, 2. Fairness. Our contributions 
can be summarized as follows: 

1) We propose a deterministic polynomial time algorithm 
for hnding an optimal communication scheme w.r.t. the 
communication cost. An important step of this algorithm 
is to iteratively determine how much should each user 

*To be precise, the fairness cost that we consider belongs to the broader 
class of separable convex costs that is studied in this work. 


transmit in an optimal scheme. We provide two methods 
to solve this problem. The hrst one is based on mini¬ 
mizing a sub modular function, in which case the total 
complexity of the algorithm is 0{{mP-N^+m7)-logN), 
where m is the total number of users, and N is the 
number of packets in the hie. The second technique 
is based on subgradient methods, in which case the 
total complexity of the algorithm can be bounded by 
log m + • log N) given that we use 

constant step size in the subgradient algorithm. 

2) We devise a randomized algorithm inspired by 
the deterministic scheme that reduces complexity to 
0{m ■ N'^ log N). The randomized algorithm is based 
on a random linear network coding scheme, and it 
achieves the optimal number of transmissions with high 
probability. To be more precise, the probability of not 
achieving the optimum is inversely proportional to the 
underlying held size |Fg|. Our randomized algorithm 
can be regarded as a generalization of the algorithm 
proposed in [Sl, where the authors considered linear 
communication cost. 

3) For the data exchange problem with additional capacity 
constraints on each user, we provide both deterministic 
and randomized algorithm of the same complexity as in 
1. and 2. 

The challenging part of the deterministic algorithm is that the 
underlying optimization problem has exponential number of 
constraints coming from the cut-set bound region. By us¬ 
ing combinatorial optimization techniques such as Dilworth 
truncation and Edmonds’ algorithm, we devise an efficient, 
polynomial time solution. 

Literature Overview 

The problem of reconciling a hie among multiple wireless 
users having parts of it while minimizing the cost in terms 
of the total number of bits exchanged is known in the 
literature as the data exchange problem and was introduced 
by El Rouayheb et al. in US). A closely related problem 
was also studied by Csiszar and Narayan in ITOl where 
all users want to agree on a secret key in the presence of 
an eavesdropper who observes the entire communication. A 
randomized algorithm for the data exchange problem was 
proposed in im, while Tajbakhsh et al. mi formulated this 
problem as a linear program (LP). The solution proposed 
in mi is approximate. 

The linear cost data exchange problem was studied by 
Ozgul et al. H, where the authors proposed a randomized 
algorithm. A deterministic polynomial time algorithm was 
proposed by Courtade and Wesel in mi concurrently to 
the authors’ work ||2l. Minimum linear communication cost 
problem was also studied in the network coding literature. 
Lun et al. M proposed a polynomial time algorithm for 
the single source multicast problem over a directed acyclic 
graph. 

In mi, na, the authors considered a different version of 
the data exchange problem where users can only broadcast 


messages to their immediate neighbors. In ifTSl it was shown 
that the problem is NP-hard, while an approximate solution 
is provided in Qa. In ini, Lucani et al. considered the 
problem of data exchange when the channel between different 
users can have erasures. 

In QOl, the authors posed a related security problem 
referred to as the “multi-terminal key agreement” problem. 
They showed that obtaining the file among the users in 
minimum number of bits exchanged over the public channel 
is sufficient to maximize the size of the secret key shared 
between the users. This result establishes a connection be¬ 
tween the Multi-party key agreement and the data exchange 
problem. 

The rest of the paper is organized as follows. In Section HU 
we describe the model and formulate the optimization prob¬ 
lem. In Section |III] we provide a polynomial time algorithm 
that solves for how many symbols in F, should each user 
transmit. We start Section Hlllbv analyzing a linear cost func¬ 
tion, and then we extend our solution to any separable convex 
cost. In Section HV] we propose a polynomial time code 
construction. In Section |V] we describe an algorithm based 
on random linear network coding approach, that achieves the 
optimal communication cost. In Section |Vl] we present a 
polynomial time solution to the problem where each user 
additionally has capacity constraints, i.e., user i is not allowed 
to transmit more than Ci symbols in F^. We conclude our 
work in Section IVIII 

II. System Model AND Preliminaries 

In this paper, we consider a setup with m users that are 
interested in gaining access to a file. The file is broken into 
N linearly independent packets wi,... ,wn each belonging 
to a field F^, where g is a power of some prime number. Each 
user i € A4 = {1,2,..., m} observes some collection of the 
linear combinations of the file packets as shown below. 


Definition 1. A rate vector R = {Ri,R 2 ,... ,Rm) is 
an achievable data exchange (DE) rate vector if there 
exists a communication scheme with transmitted messages 
(vi, V 2 ,..., Vm) that satisfies (HI for all z = 1,..., m. 

Remark 1. Using cut-set bounds, it follows that all the 
achievable DE-rate vectors necessarily belong to the follow¬ 
ing region 

7^ = {R G R"* : R[S) >N- rank(A^.,\5), V5 C 7W} , 

(5) 

where 

and 4 \J A,. 

ie5 ieM\S 

Theorem 1. Fora sufficiently large field size |Fg|, any integer 
DE-rafe vector R G Z™ that belongs to the cutset region TZ, 
can be achieved via linear network coding, i.e., it is sufficient 
for each user i G M to transmit Ri properly chosen linear 
combinations of the data packets it observes. 

Proof of Theorem [T] is provided in Appendix In Sec¬ 
tion HV] we show that any field size |Fq| larger than the 
number of users is sufficient to guarantee the existence of 
such solution. In general, finding the minimum field size can 
be a hard problem. 

In order for each user to recover the entire file, it is nec¬ 
essary to receive a sufficient number of linear combinations 
of the other users’ observations. Hence, v^, i G M, defined 
above is a vector of Ri symbols in Fg. Therefore, can be 
written as follows 

Vi = BjXi = BjAjW = UiW, (6) 

where Bi is an Ri x £i transmission matrix with elements 
belonging to Fg. In order for each user to recover the file, 
the transmission matrices Bi, z G Ad should satisfy. 


Xi = AiW, i G Ad, (1) 

where Ai G is a given matrix, and w = 

[ zui W 2 ... zuat is a vector of the file packets. In 
the further text, we refer to O as a linear packet model. 

Let us denote by Vi, a transmission of user z G Ad. In ifTOl 
it was shown that in order for each user to recover the file, 
interaction among them is not needed. Hence, without loss 
of generality, we can assume that Vi is a function of user z’s 
initial observation. We define 

R^ = |v.|, (2) 

to be the size of user z’s transmission represented in number 
of symbols in F,. To decode the file, user z collects trans¬ 
missions of all the users and creates a decoding function 

z/-, :F^; xFf X ••• xFf-^F^, (3) 

that reconstructs the file, i.e., 

(4) 


rank 


A, 

U 


= A, Vi G Ad, 


(7) 


where U = Hence, the decoding function ipi of 

user i € A4 involves inverting the matrix given in (|7]l in 
order to obtain w. 

In this work, we design a polynomial complexity scheme 
that achieves the file exchange among all the users while 
simultaneously minimizing a convex separable cost function 
Sfci where z G Ad is a non-decreasing convex 

function. Such assumption on monotonicity of function ipi is 
consistent with the nature of the problem at hand; sending 
more bits is always more expensive than sending fewer. 
From (|5]l and the above mentioned cost function, the problem 
considered in this work can be formulated as the following 
optimization problem; 

m 

min (8) 

i=l 

s.t. i?(<S) > N — rank(AyK\ 5 ), V5 C Ad. 


z/ii(Xi, Vi, . . . , Vm) = w. 




Optimization problem (|8]l is a convex integer problem with 
2”^ — 2 constraints. It was shown in lITSlI that only n of these 
constraints are active but the challenge is how to determine 
which of them are. Solving the optimization problem ([8]l 
answers the question of how many symbols in Fg each user 
has to transmit in an optimal scheme. In this paper we provide 
a polynomial time algorithm that solves problem ®. Once 
we obtain an optimal rate allocation, the actual transmissions 
of each user can be solved in polynomial time by using 
the algebraic network coding framework im, lEa. This is 
explained in Section |IV] 

III. Deterministic Algorithm 

Our goal is to solve problem ® efficiently. To do so, we 
will split it into two subproblems: 

1) Given a total budget constraint /?, i.e., R{Ai) = Ri + 
i ?2 + ■ • ■ Rm = (3, determine whether /3 is feasible or 
not. If P is feasible, find the feasible rate split among the 
users that will achieve the total budget /3 and minimize 
the cost Y.IL 1 ^i{Ri)- 

2) Find /3 that minimizes the objective function. 

The bottleneck here is how to solve Problem 1 efficiently. The 
optimal value of /3 can then be found using binary search (see 
Algorithm |3ll since the objective function is w.r.t. /3. First, let 


us identify these two problems by rewriting problem (|8]l as 

follows 

min h(B), 

/3ez+ 

(9) 

where 



h(p) = min 
RgZ'" 

777 

(10) 




s.t. R{M) = p, R{S) > N — rank(A2v^\s), V<S C M. 

Note that the optimizations (|9]l and (fTOl i are associated with 
Problem 2 and Problem 1 defined above, respectively. Next 
we will explain our approach to solving these two problems. 

A. Optimization with a given sum-rate budget P 

Now, let us focus on the set of constraints of optimization 
problem (fTol i. By substituting S with AA\S, we obtain 

R{M) = P, 

R(M \S) = R{M) - R{S) = p- R{S) 

> A-rankjAs), V5 c Ad, 5 ^ 0. (11) 

Therefore, optimization problem (fTOl i can be equivalently 
represented as follows 

771 

h(P) = min .E ¥>^{R^), (12) 

i=l 

s.t. R{M) = p, 

R{S) <P-N + rank(A5), VS C M, S ^ 0. 

Before we go any further, let us introduce some concepts 
from combinatorial optimization theory. 


Definition 2 (Polyhedron). Let be a set function defined 
over set Ad = {1, 2,..., mj, i.e., fp : 2^ Z, where 2^ 
is the power set of Ad. Then the polyhedron P{fp) and the 
base polyhedron B{fis) of are defined as follows. 

P{f^) 4 {R e I R{S) < ffiiS), vs C Ad}, (13) 
Bifp)^{KGPifp)\RiM) = UM)}. (14) 

Note that the set of constraints of problem (fT2l i. for any 

fixed P G Z+, constitutes the base polyhedron B{fp) of the 
set function 

{ P — N -\- rank(A5) if 5 C Ad, 5^0 

P if 5 = Ad, (15) 

0 if 5 = 0. 

Example 1. Let us consider the source model from Figure [T] 
where the three users observe the following parts of the file 


W2 W3 

W4 

Wb 

We 

P 


XI = [ 

Wi 

W2 

r. 



X 2 = [ 

W2 

W4 

We 

We , 


X 3 = [ 

W3 

W4 

We 

We ]'^. 

(16) 


For /3 = 4, the base polyhedron P{fi) is defined by the 
following set of inequalities: 

idl </4({l}) = 0, i?2 </4({2}) = 2, i?3 </4({3}) = 2 
Ri+R2 < /4({1, 2}) = 3, i?i + i?3 < /4({1, 3}) = 4, 
i?2 + id3</4({2,3}) = 3, 

i?i+R 2 + i ?3 </4({1,2,3}) = 4. (17) 

It can be verified that no rate vector {Ri, R 2 , R 3 ) G -P(/ 4 ) 
exists such that i?i+i? 2 + 7 d 3 = 4. Therefore, B^f^) = 0. On 
the other hand, for /3 = 5, the polyhedron P{fP) is defined 
as follows 

< /5({l}) = 1, i?2</5({2}) = 3, id3 < /5({3}) = 3 
RiPR2< /5({1, 2}) =4, i?i + i?3 < /5({1, 3}) = 5, 
i?2 + i?3</5({2,3})=4, 

i?i+R 2 + i ?3 </5({1,2,3}) = 5. (18) 

It can be easily verified that the rate vector i?i = 1, 
i ?2 = 3, i ?3 = 1 belongs to the polyhedron /’(/s). Therefore, 

B{h) ^ 0 . 

Summarizing the discussion so far, the optimization prob¬ 
lem (IT 2 I 1 is equivalent to 

771 

min .E pPR,), s.t. ReB(/^), (19) 

where fp is defined in (fTsT l. For now, let us assume that pa¬ 
rameter P is chosen such that the optimization problem (fT9] l 
is feasible, i.e., B{fp) 7 ^ 0. We will explain later how the 
condition B{fp) ^ 0 can be efficiently verified. 

The main idea behind solving the optimization problem 
in (fT9l l efficiently, is to utilize the combinatorial properties 
of the set function fp. 


Definition 3. We say that a set function / : 2^ —Z is 
intersecting submodular if 

f{s) + f{r)>f{suT) + f{snr), 

WS.TCM s.t. 5nr^0. (20) 

When the inequality conditions in (l20l l are satisfied for all 
sets S,T C Ai, the function / is fully submodular. 

Lemma 1. The function fp is intersecting submodular for 
any j3. When (d > N, fp is fully submodular. 

Proof of Lemma [T] is provided in Appendix O 

Theorem 2 (Dilworth Truncation ||2T1 '). For every intersect¬ 
ing submodular function fp there exists a fully submodular 
function gp such that both functions have the same polyhe¬ 
dron, i.e., P{gp) = Pifp), and gp can be expressed as 

gp{S) = min < fp{y) -V is a partition of S > . (21) 

Ivg-p J 

The function gp is called the Dilworth truncation of fp. 

The base polyhedron of any fully submodular function 
always exists, i.e., there exists a rate vector R such that 
R{M.) = gp{M). Since, P{gp) = P{fp), it follows that 
Bidfi) = Bifp) whenever (Ad) = fp{M) = jd, i.e., when 
B{fp) f 0 which implies feasibility of the optimization 
problem (fTOl l. 

Continuing with Example |2] the Dilworth truncation of the 
set function /4 is given by 

P4({1}) = 0, 54({2}) = 2, g4({3}) = 2, 

P4({1,2}) = 2, P4({1,3}) = 2, 54({2,3}) = 3, 
54({1,2,3}) = 3. (22) 

Note that /4({1, 2, 3}) f 54({1, 2, 3}), and hence, /3 = 4 is 
not a feasible sum-rate for the problem (fT^ . On the other 
hand, for /3 = 5, Dilworth truncation of a set function f^ is 
given by 

ff5({l}) = l, ff5({2}) = 3, g5({3}) = 3, 

55({1,2}) = 4, P5({1,3}) = 4, P5({2,3}) = 4, 
55({1,2,3}) = 5. (23) 

Now, /sdl, 2, 3} = psdl, 2,3}) =/? = 5 which indicates 
that /3 = 5 is a feasible sum-rate for the problem ( fT9] l. Hence, 
the optimization problem ( fT9] l can be written as 

m 

min p^{Ri), s.t., I{.€B{gp) (24) 

i=l 

provided that gp{A4) = (d. 

Remark 2. Parameter /3 is feasible w.r.t. the problem (fT9l l 
if gp{A4) = jd. Otherwise, gp(A4) < fd. This is the direct 
consequence of the Dilworth truncation (1211 1. 

Depending upon the cost function T{B,i), in the 

sequel, we provide several algorithms that can efficiently 


solve problem (|9]l. First, we analyze a special case when 
the cost function is linear, 

(pi{Ri) = aiRi, ai>0, \/i £ M. (25) 

The condition ai > 0, i € Ai ensures that pi is a non¬ 
decreasing function. 

B. Linear Cost - Edmonds ’ Algorithm 

When the cost function is linear, the optimization prob¬ 
lem (l24l l has the following form 

m 

miri_ ajRi, s.t., R e i3(p^). (26) 

i=l 

Due to the submodularity of function gp, the optimization 
problem (|2^ can be solved analytically using Edmonds’ 
greedy algorithm ll22l (see Algorithm [T]i. 


Algorithm 1 Edmonds’ Algorithm 

1: 

Set j{l),j{2),...,j{m) to be 

an 

ordering of 


{1,2,..., to} such that < 0 :^( 2 ) < • • 

^ ^j{7n) 

2: 

Initialize R* = 0. 



3: 

for i = 1 to TO do 



4: 

%) =5/3({j(1),j(2),... 





• ■ 

-I)})- 

5: 

end for 




The greediness of this algorithm is reflected in the fact that 
each update of the rate vector is sum-rate optimal: 

^4(1)+’R4(2) =5/5({j(1),J'(2)}) 


i=l 

In other words, at each iteration, the individual user’s rate up¬ 
date reaches the boundary of polyhedron P{gp). Optimality 
of this approach is the direct consequence of submodularity 
of function gp fT2\ . 

Remark 3. The optimal rate vector R* belongs to the base 
polyhedron B{gp). In other words, 

m 

Y,R*=gp{A4). (28) 

i=l 

Remark 4. The complexity of Edmonds’ algorithm is 
0 {m ■ d), where A is the complexity of computing function 
gp{S) for any given set 5 Q Ai. 

Example 2. Let us consider the same source model as in 
Example |2] and let the cost function be Ri -f 3i?2 + 2i?3, 
and /d = 5. The intersecting submodular function fp, and its 
Dilworth truncation gp are given in (fTSl l and (|2^ . respec¬ 
tively. The rate vector is updated in an increasing order w.r.t. 






the weight vector. In this case, the order is 1 —3 —2 (see 
Figure |2]l. 


Ri 



Fig. 2. Edmonds’ algorithm applied to the three-user problem described 
in Example [2] with the cost function iJi -I- 3i?2 + 2 R 3 . To minimize the 
cost, the order in which we greedily update communication rates should be 
increasing w.r.t. the weight vector, i.e., 1 —> 3 —> 2. The optimal DE-rate 
vector is R’^ = 1, ijj = 1, R 3 = 3. 

The main problem in executing Edmonds’ algorithm effi¬ 
ciently is that the function gp is not available analytically. To 
compute this function for any given set <S C Ad we need to 
solve minimization problem (I 2 TI 1 . Such minimization has to 
be performed over all partitions of the set S, which annuls 
the efficiency of the proposed method. 

To overcome this problem note that we have access to the 
function fp (see (flSl l). and by Theorem |2] we know that 
P{gp) = P{fp). As pointed out before, each rate update 
reaches the boundary of polyhedron P{gp) (see dZTl i). Since 
we don’t explicitly have function gp, this polyhedron bound¬ 
ary can be calculated by applying the Dilworth truncation 
formula (ISTT l. For the three-user problem in Example |2 this 
procedure would go as follows: 

Rl = h{{l}) = l, 

Rl = min{/5({l, 3}) - /5({3})} = 3, 

Rl = min{/5({l, 2,3}) - - R^ h{{l, 2}) - Rl, 

/5({2,3})-i?3*,/5({2})} = l. 

Generalization of this procedure to an arbitrary number 
of users is shown in Algorithm |2] We refer the interested 
reader to references im and where this algorithm 
is explained in more details for an arbitrary intersecting 
submodular functions. 

In each iteration i, the minimization problem ( l29b is over 
all subsets of , j{i)}. Using the fact that all the 

subsets considered in (|29] | contain a common element j{i) it 
is easy to see that fp{S) — R*{S) is fully submodular over the 
domain set {j(l), j(2), . .. ,j{i — 1)}. Now the polynomial 
time solution of Algorithm |2] follows from the fact that 
minimization of a fully submodular function can be done 
in polynomial time ll24l . 


Algorithm 2 Modified Edmonds’ Algorithm 

1 : 

Set j{l),j{2),...,j{m) to be 

an 

ordering of 


{ 1 , 2 ,..., to} such that ocjpY) < Oij{ 2 ) < ’' 

• < ^j(m) 

2 : 

Initialize R* = 0. 



3: 

for i = 1 to TO do 



4: 

= mJn{//3(<S U {j(*)}) - 

R*{S) 



5C{j'(l),j(2),. 


-1)}} (29) 

5: 

end for 




Remark 5. The complexity of Algorithm |2] is 
0{m ■ SFM{m)), where SFM{m) is the complexity 
of minimizing submodular function. The best known 
algorithm to our knowledge is proposed by Orlin in ll24ll . 
and has complexity 0 {m^ • 7 + m®), where 7 is complexity 
of computing the submodular function. For the submodular 
function defined in (l29l l. 7 equals to the complexity of 
computing rank, and it is a function of the file size N. When 
users observe linear combinations of the file packets, the 
rank over can be computed by Gaussian elimination in 
0{N^) time. For the “raw” packet model, rank computation 
reduces to counting distinct packets, and therefore its 
complexity is 0{N). 

Remark 6. From Remark |2] and the fact that Edmonds’ 
algorithm provides a rate vector with sum-rate gp{M), it 
immediately follows that if Algorithm |2] outputs a rate vector 
R*’ such that R*{M) < (3, then B{fp) = 0, and such /3 is 
not a feasible sum-rate w.r.t. the problem (fT9l l. Hence, for 
any given /3, the feasibility of such sum-rate can be verified 
in 0{rn ■ SFM{m)) time. 

C. Finding the optimal value of /? 

So far we have shown how to compute function h{l3) 
defined in (fTSl i for any /3 when ipi{Ri) = aiRi. To complete 
our solution, i.e., to solve the problem defined in @, it 
remains to show how to minimize function h{/3) efficiently. 

Theorem 3. Function h(l3), defined in (IIOi l, is convex when 
jd is a feasible sum-rate w.r.t. the optimization problem (Hoji. 

Proof of Theorem [3] is provided in Appendix |B] 

In order to minimize function h, first, we identify the set 
of sum-rates (3 that are feasible w.r.t. the problem (|9]l. More 
precisely, we need to find the minimum sum-rate, since every 
/3 that is larger than or equal to such value is feasible as well. 
Hence, we proceed by analyzing the sum-rate objective, i.e., 
when ipi{Ri) = Ri. 

For any fixed parameter (3 G Algorithm |2] provides 
an optimal rate allocation w.r.t. the linear cost. It is only left 
to find /3 that minimizes h{f3) in (0. Let us first consider 
the sum-rate cost, i.e., pi{Ri) = Ri. From the equivalence 
of the Algorithms [1] and |2] and from Remark [3] it follows 
that for any given parameter jd, the output rate vector R*’ of 














Algorithm |2] satisfies 


Y,Rt=9p{M). (30) 

i=l 

Thus, for a randomly chosen parameter /3 we can verify 
whether it is feasible w.r.t. the problem (fTZt by applying 
Remark |2] i.e., ifYZiRl = P, then such sum-rate can be 
achieved. Therefore, we can apply a simple binary search 
algorithm to find the minimum sum-rate. Note that the 
minimum sum-rate is always less than or equal to the file 
size N. Hence, we can confine our search accordingly (see 
Algorithm [3ll. 


Algorithm 3 Minimum Sum-Rate Algorithm (binary search) 
1 : Initialize Pstart = 0, Pend = N. 

2: while Pend “ Pstart > 1 dO 

3 : P = \ 9Btart+l3erd -^_ 

4: Execute Algorithm |2] with parameter p. 

5: if 1 Rt = then 

Pend — P' 

7: else Pstart — P- 

8 : end while 

9: Pend is the minimum sum-rate. 


Remark 7. The complexity of Algorithm |3] is 

0{m ■ SFM{m) ■ log A^). 

For the general linear cost function (pi{Ri) = aiRi, by 
Theorem [3 h{P) is convex for P greater than the minimum 
sum-rate (obtained from Algorithm O. In Section im:i 
Lemma|5] we show that the search space for P that minimizes 
function h can be limited to the file size N. Hence, in order 
to solve the minimization problem (O we can apply a simple 
binary search algorithm that finds the minimum of h{P) by 
looking for a slope change in function h. 


Algorithm 4 Minimum Linear Cost Algorithm 
1 : Initialize pstart = Ptum^ Pend = N, where is the 

minimum sum-rate obtained from Algorithm [3] 

^start^end ~| 

2 !• 

3: Execute Algorithm |2] for P — P, and P + \. 

4: if h{P) < h{P — 1) and h{P) < h{P + 1), then 
5: R* that corresponds to the sum-rate P is an optimal 

rate allocation 

6 : else if h{P — 1) > h{P) > h{P -\- 1), then 

7 : Pstart = /? + !. 

8 : else Pend = P - 1. 

9: Go to Step 2. 


Remark 8. Since for any fixed P, h{P) can be found 
by using Algorithm |2] and P*^^ can be found by ap¬ 
plying Algorithm [3l the complexity of Algorithm |4] is 
0{m ■ SFM{m) ■ log A^). 


D. Using Subgradient Methods to Solve Step 4 of Algo¬ 
rithm \2\ 

In this section we propose an alternative solution to the 
minimization problem (|2^ in Algorithm [2] that does not in¬ 
volve minimization of a submodular function. The underlying 
linear optimization problem has the following form 

m 

min s.t. R £ (31) 

i=l 

given that /3 is a feasible sum-rate. Without loss of generality, 
let us assume that ai < q ;2 < • • • < am- In this case, the 
minimization in Step 3 of Algorithm |2] can be written as 

R* = min{// 3 (<S) — R*{S) : i € S, 

s 

5C{l,2,...,i}}, (32) 

Minimization (l32l l can be interpreted as a maximal update 
along the coordinate such that R* still belongs to poly¬ 
hedron P{fp). This problem can be separately formulated as 
the following minimization problem 

R* = maxi?i, (33) 

ReR* 

s.t. Rfe > Rl, fc = 1,2,..., i - 1, 

R{S U {z}) < fp{S U {*}), V5 C {1, 2,..., i - 1}. 

Note that in an optimal solution, the condition Rk > R^, 
k = 1 ,... — 1 , holds with equality because any possible 

increase of Rk can lead to the smaller value of Ri. Moreover, 
since the above minimization is over an integer submodular 
polyhedron, the optimal solution is also an integer number. 
Therefore, minimization problems (l3^ and (l32l i are equiva¬ 
lent. 

Let us denote by the rate region that corresponds to 
the optimization problem (l33l l. 

= {R e R* I R{S U {i}) < fp{S U {i}), 

V5C{l,2,...,i-l}}. (34) 

To solve optimization problem (l33t . we apply the dual 
subgradient method. First, the Lagrangian function of the 
problem (l3^ is 

z-l 

C{R,X) = R, + Y,^k{Rk-R*k), RG7e«, (35) 

k=l 

where Xk > 0, k = 1,2,... ,i — 1. Then, the dual function 
(5(A) equals to 

(5(A) = max E(R, A) 

Re7?.(*) 

{ i-i 'j i-i 

Ri + V'A/cR/c I(36) 

Due to the maximization step in (|36]l over multiple hyper¬ 
planes, it immediately follows that 6 {\) is a convex function. 
By the weak duality theorem ||25l, 

(5(A) > R*, VAfe > 0, k=l,2,...,i-l. (37) 















Hence, 

mm {6{X) I Afc > 0, fc = 1, 2,..., i - 1} > i?* (38) 

Since optimization problem (l3^ is linear, there is no duality 
gap, i.e., 

R* = mm{5(A) | Afe > 0, k = 1,2,... ,i - 1} . (39) 

To solve optimization problem (|39] |. we apply the dual 
subgradient method 12^ as follows. Starting with a feasible 
iterate Afc[0], k = 1,2,...,* — 1, w.r.t. the optimization 
problem (|39] |. and the step size 9j, every subsequent iterate 
Afc[j + 1] for all k = 1,2 ,...,* — 1, can be recursively 
computed as follows 

Afc[j + 1] = {Afc [j] -9,iRk[j]-Rl)}^, (40) 

where Rk[j] is an optimal solution to the problem 

max Ri+y^ Xk[j]Rk- (41) 

Note that Rk[j], k = 1, 2, ...,* — 1, is a derivative of the 
dual function 5(A[j]). 

Lemma 2. An optimal solution to the problem (|4T]i can be 
obtained as follows. Let f(l), t{2),..., be an ordering 

o/ 1, 2, ...,*- 1 such that At(i) > \t( 2 ) > ■ > Xt{i-i). 

Then, 

10 , otherwise. 

k-l 

Rt(k) = fp{St{k) U {*}) - ^ Rt(u)[j] - Ri[j], (42) 

u—1 

for k = 1 , 2 ,..., i-l, where St(k} = 1^(1), ^(2), ■ ■ ■ 

Proof of this Lemma is provided in Appendix |D] 

Remark 9. The complexity of the algorithm proposed by 
Lemma |2] is 0{i log * + * ■ N^). 

The reason why we apply subgradient methods instead of 
a gradient descent is because function 5(A) [j] even though 
convex, is not differentiable. From Lemma |2] it follows that 
for a given A[j], there may be more than one maximizer of 
the problem (HTt . Due to possibility of having more than one 
direction along which we can update vector A[j] according 
to (l40l i. subgradient method is not technically a descent 
method; the function value 5(A[j]) may often increase in 
the consecutive steps. For that reason, at each step we keep 
track of the smallest solution up to that point in time 

A[i] =argmin{5(A[0]),5(A[l]),...,5(A[j])}. (43) 

Before we go any further, note that the primal optimization 
problem (l3^ is over real vectors. However, the minimiza¬ 
tion (l3n i is an integer optimization problem. As pointed out 
above, the optimal solution of the problem (l33l l is equal to 


the solution of the problem (l3Tl l. Therefore, we can choose 
the number of iterations I of the dual subgradient method 
such that we get “close enough” to an integer solution. In 
other words. 


6{\[l]) - R* 


< £, 


(44) 


where s < 0.5. Then, 


i?* = round(5(A[(])). (45) 


Convergence Analysis 

In this section we explore the relationship between the 
number of iterations of the dual subgradient method I, and 
the step size 9j, such that it is guaranteed that (|45] | provides 
the optimal solution. 


Lemma 3. Let A* be an optimal vector that minimizes the 
dual function 5. Then, 


(eLi v[oi)“ + (ei'A A;)L2Jv*E‘;i«| 

Proof of Lemma [3] can be derived from the notes on 
subgradient methods presented in ll26l . For the sake of 
completeness, we provide its entire proof in Appendix |E] 
Since by Lemma |2 A* can be an arbitrary minimizer of 
the dual function 5, let us choose A* that can be bounded as 
suggested by the following lemma. 


Lemma 4. There exists an optimal solution to the prob¬ 
lem (I391 l that satisfies 


i-l 

^ A{ < m. (47) 

fe=i 

Proof of this Lemma is provided in Appendix |F] Initial 
feasible A[0] can be chosen as follows 


Afe[0] = 0, Vke{l,2,...,i-1}. (48) 


nbining (l46l l. (l47l l and (|49] |. we obtain 

„2 I 0Ar2v^*-l 


W-1])-5(A*) 


< 


m' 




(49) 


There are many ways to choose the step size that satisfies 
the condition (gUl. Here, we briefly examine the constant 
step size, where 9j = 9, j = 0,1,2,.... For other choices on 
selecting an appropriate step size 9j, we refer the interested 
reader to the notes on subgradient methods. When 9j = 9, 
the inequality (l49l l becomes 


5(A[/-1])-5(A*) < 


rri^ + 2NH9^ 

m 


Hence, the condition (l44l i is satisfied when 


(50) 


*7*2 -p 2NH9'^ ^ 1 

m 2 ' 


( 51 ) 








It can be easily verified that (fSTT ) holds when 

1 


< 


I > 




m 


(52) 

(53) 


0(1 - 2N^9)' 

Putting all these results together, the minimization (l32l l can 
be obtained by running Algorithm |5] 


Algorithm 5 Minimization (|29] | of Algorithm |2] 

1 : Select parameters I, and 9j, j = 0,1,..., Z — 1 such that 


2 

3 

4 


^ 2 - 

Set Afe[0] = 0, /c = l,2,...,i- 1 , and A[0] = A[0]. 
for j = 0 to Z — 1 do 


(54) 


Afe[j + 1] = {Afc[i] - 9,iRk[j] - , 


for k = l,2,...,i — 1, where R[j] is computed 
according to Lemma |2] 

5: 

X[j + 1] = argmin |5(A[j + 1]), 5(A[j])| . 

6 : end for 

7: 


R* = round ^5(A[Z])^ . 


Remark 10. From Remark |9] it follows that the complex¬ 
ity of Algorithm |5] is SFM{m) — O{lm\ogm + Im'y). 
For a constant step size 9, from (l52T i and (l53]) it follows 
that the complexity of Algorithm |5] can be bounded by 

log m + 

E. General Convex Separable Cost 

In the previous section, for the linear cost function, we 
applied Edmonds’ algorithm in order to obtain the optimal 
rate allocation. Edmonds’ algorithm is greedy by its nature 
since all rate updates are reaching the boundary of polyhe¬ 
dron P{gp). This effectively means that Edmonds’ algorithm 
provides rate allocations that are vertices of the base polyhe¬ 
dron B{gp). While this was an optimal approach in the case 
of linear objectives, for the general convex separable cost 
function the optimal rate vector may not belong to a vertex 
of B{gp). We will show this in Example [3] 

The general convex cost optimization problem 

m 

'Pi{Ri), S.t. ReS(p^) (55) 

i=l 

is known as a resource allocation problem under submod- 
ular constraints ED, and it can be solved by applying 
the following intuitive approach: instead of applying greedy 


scheme, we will incrementally update by one symbol in Fg 
a communication rate of a user that has the minimal discrete 
derivative (see Algorithm | 6 ]l. 


Algorithm 6 Minimizing convex separable cost under sub- 
modular constraints_ 

1 : Set Ri = 0, \/i £ M 
2 : for j = 1 to /3 do 
3 : Find i* £ M such that 

i* = argmin -f 1) | R-I- e(i) G P[gp)} , 

i&M 

where di{Ri + l) = ipi{Ri+l)—(pi{Ri), and e(*) is the 
unit basis m-dimensional vector with coordinate 
equals to 1 . 

4: Set Ri* = Ri* + 1. 

5: end for 

6 : R* = R is an optimal rate vector w.r.t. the problem (fSSTl. 


Definition 4. Let us define set to be the set of all users 
that are in iteration j of Algorithm | 6 ] allowed to update their 
transmission rates. 

Tj={i\R + e{^£P{gf3)}. (56) 

The question is how to efficiently recover set 7j in each 
round of Algorithm| 6 ] First, we observe that P{gp) = P{fp) 
according to Theorem ID Second, note that in Algorithm |2] 
the minimization ( |29] | outputs the maximum rate vector 
update along one coordinate. Therefore, we only need to 
verify whether such update is at least equal to one symbol 
in Fg. In other words, i G 7j if 

cr-’^iw u {i}) - R{S U {i})} > 1. (57) 

5CA1\{2} 

Putting these results together, we can obtain a polynomial 
time solution to problem (fT^ by applying Algorithm [T] 


Algorithm 7 Minimizing convex separable cost under inter¬ 
secting submodular constraints 
1 : Set Ri = 0, \/i £ M 
2 : for j = 1 to /3 do 
3: Construct set 7j as follows 

" i* ■ ^ ^ 

L SCA4\{i} J 

(58) 

4: Find i* £Tj such that 

i* = argmin{di(i?i + 1 )}. 

i&Tj 


5: Set i?i. = i?i. -I- 1. 

6 : end for 

7; R* = R is an optimal rate vector w.r.t. the problem (l55ll. 



















The complexity of (|57] | is SFM{m), since the function 
fp^S) — R*{S) is fully submodular. This check can be 
done either by minimizing submodular function as suggested 
in (l57l i or by running the dual subgradient algorithm similar 
to the one proposed in Section ITll-PI Here, we briefly explain 
the differences. First, rate region defined in (l34l) . now 
has the following form 

= {R e R"* I R{S U {i}) < fp{S U {i}), (59) 

VSCM\ {z}}. 

Let us denote by R* G R"* the current rate allocation in 
round j of Algorithm |7] Then, if the maximization 

max Ri, (60) 

Re-Rt*) 

s.t. Rk> R-kJ fc = 1, 2 ,..., m, (61) 

is at least 1, then i G Tj. Problem (l60l l can be solved by 
following the same steps in solving the dual problem as in 
Section Illl-DI 

Remark 11. At each iteration. Algorithm 0 calls ( l58l l m 
times, and there are total of /3 iterations. Therefore, the 
complexity of Algorithm [T] is 0{m ■ j3 ■ SFM{m)). 

Lemma 5. Let us denote by j5* the minimizer of the function 
h defined in (1121) . Then, /3* < N. 

Proof of Lemma |5] is provided in Appendix |G] 

For the general non-decreasing set of functions ipi, i G 
j\4, from Theorem [3 we know that function h is convex. 
Moreover, by Lemma |5] it follows that the minimizer of h is 
at most equal to N. Therefore, in order to minimize h, we 
can apply Algorithm |4] which computes /i(/3) for any (3 by 
applying Algorithm |6] Thus, the overall complexity of the 
proposed solution is 0{m ■ SFM{m) ■ A^log A^). 

F. Fairness under the fixed sum-rate budget 

In this section we study the problem where for the fixed 
feasible sum-rate budget /3, the goal is to distribute communi¬ 
cation load to users as evenly as possible. Linear cost function 
is by its nature “unfair,” since it can potentially result in a 
communication scheme where only a small group of users 
transmit packets. For the fixed sum-rate budget, the “fairness” 
can be achieved by introducing an uniform, non-decreasing 
(in the integer domain) objective (pi{Ri) — Ri log Ri, i = 
1 ,... ,m, and it is illustrated in the example below. 

Example 3. Consider the same three-user problem as in 
Example |2l 

Xl = [ W 2 ]^ , 

X2 = [ Ui2 W4 W5 W6 ] ^ , 

X3 = [ W3 W4 W5 W6 ] ^ , 

where Wi G Fg, z = 1 ,..., 6. 

In case of a linear objective 2Ri + i ?2 + 3i?3, for a given 
sum-rate /? = 5, we showed hat the optimal DE-rate vector, 



Fig. 3. Algorithm [^applied to the three-user problem from Example[3] with 
the cost function Ri log Ri and the fixed sum-rate Ri H- R 2 -h R 3 = 

5. To minimize the cost, in each iteration we update the rate of the user 
who has transmitted the least amount of symbols in Fq such that the update 
still belongs to polyhedron P{f^). 

obtained by using Algorithm [S] belongs to a vertex of the 
base polyhedron B{fp): 

Rl = l, i?^=3, Rl = l. (62) 

Let us now analyze the case when the objective is ipi{Ri) = 
Ri log Ri, i = 1, 2,3. Following the notation of Algorithm |6] 
we have that 

d^{Ri -f 1) = {Ri + 1) log(i?i -Gl) - R^ logRj. (63) 

It is not hard to show that the above function di{-) is 
increasing. Hence, the minimization step in Algorithm [T] can 
be written as 

i* = argmini?i, (64) 

idTi 

where 7j can be computed from (fSTl l. and j = 1,... ,f3 is 
an iteration of Algorithm |7] The condition dMl) proves that 
Ti{Ri) = logi?i is a good measure for fairness, since it 
is enforcing the transmission vector R to be as uniform as 
possible. The execution steps of Algorithm |6] are shown in 
Figure |6] It can be verified that 

Ti = {1,2,3}, T2 = T3 = T4 = % = {2,3}. 

Therefore, the optimal DE-rate vector for this example is 
RI = fR* = 2, R* = 2. 

IV. Code Construction 

In Theorem [T] we showed that in order to achieve optimal 
communication rates, it is sufficient for each user to transmit 
the optimal number of linear combinations of its observa¬ 
tions. In this section, we show how to efficiently design the 
transmission scheme. We explain the code construction on 
the three user problem from Example |2] where 

Xi = [ W 2 ], 

X2 = [ W2 W4 W5 W6 ] , 

X3 = [ ^3 W4 W5 We ] . 


( 65 ) 














For the objective function mini?i + i ?2 + R 3 , we showed 
that the optimal DE-mte vector is i?* = 1, i ?2 = 1, and 
i ?3 = 3. This means that in an optimal scheme users 1, 
2 and 3 transmit 1, 1, and 3 linear combinations of their 
own observations in F^, respectively. We design the coding 
scheme by first constructing the corresponding multicast 
network (see Figure |4|i. In this construction, notice that there 
are several types of nodes. First, there is a super node 
S that has all the packets. Each user in the system is a 
transmitter, while in addition, each user is also a receiver. To 
model this, we denote Si, S 2 S 3 to be the “transmitting” 
nodes, and ri, r 2 and r 3 to be the “receiving” nodes. The 
side-information observed by users 1, 2 and 3 gets directly 
routed from si, S 2 and S 3 to the receivers ri, r 2 and 
through direct edges (dashed edges in Figure |4|i. To model 
the broadcast nature of each transmission, we introduce the 
“dummy” nodes ti, <2 and ts, such that the capacity of the 
links is the same as link capacity {ti,rj), j ^ i, and 

is equal to R*, \/i G A4. 


model and the communication model can be represented by 
multicast acyclic graph. Therefore, Jaggi et al. algorithm 
would find actual transmissions of each user in polynomial 
time. 



Fig. 5. When each user observes subset of the file packets, we can model 
the observations by adding an extra layer of = 6 nodes to the graph in 
Figure |4] Each extra node represents one file packet, and all extra edges 
are of capacity 1. Then, users’ observations can be modeled by connecting 
nodes from this layer to the users’ nodes si, S 2 and S 3 according to jS). 



Fig. 4. Multicast network constructed from the source model and the sum- 
rate optimal DE-mte vector R* = 1, Rj = 1, R 3 = 2. Flence, in an optimal 
scheme users 1, 2 and 3 are transmitting 1, 1, and 3 linear combinations 
of their own observations in ¥q, respectively. Each user receives side- 
information from “itself” (through the links Si —>■ ri, i = 1,2,3) and 
from the other users (through the links ti —t rj, i,j G {1, 2, 3}, i 7 ^ j). 


Now, when we have a well-defined network it is only left 
to figure out transmissions on all the edges. For instance, 
this can be achieved using Jaggi et al. algorithm ll28l . The 
first step of this algorithm is to determine N = 6 disjoint 
paths from the super-node S to each receiver ri, r 2 and rs 
by using the Ford-Fulkerson algorithm ll2^ . Such paths are 
designed to carry linearly independent messages from the 
super node to the receivers. When each user observes some 
subset of the file packets (as it is the case in this example), we 
can directly apply Jaggi et al. algorithm to this problem by 
slightly modifying the upper portion of the multicast network 
from Figure |4] (see Figure |5]l. Note that in this case, we 
were able to model observations of each user simply by 
adding one more layer of nodes which represent individual 
file packets, and then connecting these packet nodes with 
each user according to (l65T l. In other words, the entire source 


In the case of general linear packet model, it is not possible 
to represent users’ observations just by adding one extra 
layer of nodes to the multicast graph as in Figure 0 This 
is because there is an underlying correlation between all 
the linear combinations that appear in the users’ observation 
vectors, and it would be suboptimal to treat all these com¬ 
binations independently. For that reason, it is more suitable 
to apply Harvey’s algorithm ll20l which is based on matrix 
representation of transmissions in the network lfT9l , ll30l . and 
simultaneous matrix completion problem over finite fields. 
In the remainder of this section, we briefly examine building 
blocks of this code construction algorithm. 

First, we choose source matrix A to be the side- 
information matrix of all users as, 

A=[Af ... Al 0...0], ( 66 ) 

where A^ corresponds to the observation matrix defined in 
dD- Matrix A is an A x £ matrix, where i is the total number 
of edges in the network. 

The transfer matrix M(ri) from the super-node S to any 
receiver r^, i G A can be obtained as shown in lfT9l . It is a 
N X N matrix with the input vector w, and the output vector 
corresponding to the observations at the receiver 

M(rO = A(I-r)-iD(rO, i = l,2,...,m, (67) 


where F is adjacency matrix of the multicast network, and 
D(ri) is an output matrix. For more details on how these 
matrices are constructed, we refer the interested reader to 
the reference im. 

A multicast problem has a network coding solution if and 
only if each matrix M(ri) is non-singular. In 1201 . the author 
showed that for the expanded transfer matrix defined as 


E(r.) 


A 0 

I-r D(r,) J ’ 


i = 1, 2,..., m, (68) 


it holds that det(M(ri)) = ±det(E(ri)). 











Some entries of the matrices F and D(ri), i = 1,2,... ,m, 
are unknowns. To obtain the actual transmissions on all 
the edges, it is necessary to replace those unknown entries 
with elements from such that all matrices E(ri), i = 
1,2,... ,m, have full rank. This is known as a simultaneous 
matrix completion problem and it is solved in ll 20 l in 
polynomial time provided that |Fq| > m. 

Remark 12. The complexity of the algorithm proposed 
in II 20 II applied to our problem is 0{m‘^ ■ 7 • log(TO • N)), 
where 7 is the complexity of computing rank. 

V. Randomized Algorithm 

In this section we combine Algorithm | 6 ] with the linear 
network coding scheme to produce a randomized solution 
to the optimization problem (IT 2 l) of linear complexity (in 
number of users). First, note that Algorithm | 6 ] is incremental 
by its nature, i.e., in each iteration we update the rate vector 
by one symbol in F^. Say that user i updates its rate at 
round j of Algorithm| 6 ] Along with the rate update, let user i 
transmit an appropriately chosen linear combination of its 
observations; using the notation from Section [III we have 

= bp’ • A, • w, (69) 

where bp’ G Fp, is the vector of coefficients that lead 

I y 

to the optimal communication scheme. We note that those 
coefficients are not known a priori; they can be hgured out 
by applying the algorithm proposed in Section |IV] only after 
the entire optimal DE-rate vector is recovered. For now, let 
us just assume that we have access to the vectors b^ for 
all iterations j = 1,..., /3, and for all users i € A4 that 
are scheduled to update their communication rates. Later, we 
will use random linear network coding argument to relax 
these assumptions. 

In the expression ( |69] |, let us dehne G F^ as 

= bp’• A,. (70) 

Then, we can write (l69l l as 

wp’ = • w. (71) 

By generating transmissions along with the rate updates, we 
can actually reduce the complexity of verifying whether the 
rate vector update still belongs to the polyhedron P{fp). This 
result is stated in the following theorem. 

Theorem 4. Let the set 'Tj be defined as in (156b . Then, 

Tj = {i € M I rank ^A^ U U • • • U 

>N-{P-j + l)}. (72) 

Proof of Theorem |4] is provided in Appendix |H] 

So far we have assumed that the vectors are provided 
to us deterministically, and that they render optimal commu¬ 
nication scheme. However, this assumption is unjustihable 
since we saw in Section |IV] that in order to construct a de¬ 
terministic communication scheme we need to know optimal 


DE-xate. vector beforehand. To go around this problem we 
invoke a random linear network coding scheme. The basic 
idea behind the random linear network coding argument is 
that if user i is scheduled to transmit in round j, then we 
can choose vectors bp’ in ( |69] | uniformly at random over 
Fp. The following lemma provides a relationship between 
probability of generating optimal transmissions and the held 
size q. 

Lemma 6 . For the random linear network coding scheme, 
the probability of choosing an optimal sequence of vectors 
j = 1,2 ,..., P, is at least (1 — . 

The proof of Lemma | 6 ] directly follows from Il30l . The 
idea is to relate this problem to a multicast problem as in 
Section IIVI while assuming that the optimal rates are given. 
Then, by randomly generating transmissions on each link, we 
obtain the exactly same formulation as in lf30l . 

Putting all these results together, from Algorithm | 6 ] we 
can devise its Randomized counterpart as follows (see Algo¬ 
rithmic). 

Algorithm 8 Randomized Algorithm 

I: Set = 0, Vi G Af 

2 : for j = 1 to /3 do 

3: Determine 7j as dehned in (l72l i. 

4: Find i* G 75 such that 

i* = axgxnm{di{Ri -f 1) | i G Tj} . 

ii) 

5: Let i* transmit, and create a transmission v}i by 

creating a vector bp’ uniformly at random over Fp*. 

6 : Set Ri* = Ri* + 1. 

1 : end for 

8 : R* = R is an optimal rate vector w.r.t. the problem (l55ll. 


Remark 13. The complexity of Algorithm[^is 0{m ■ y ■ N), 
where 7 is the complexity of computing rank. 

Remark 14. When /3 is not a feasible sum-rate w.r.t. the opti¬ 
mization problem (fTSl i. then after /3 iterations of Algorithm^ 
there exists a user that cannot reconstruct all the packets. In 
other words 

3i G Af, s.t. rank ^A^ U U • • • U < N. 

In order to solve the optimization problem (O, we can 
apply a binary search algorithm similar to Algorithm |4] 
Thus, the overall complexity of the proposed algorithm is 
0{m ■ 7 • A log A). 

Example 4. Let us consider the same problem as in Exam¬ 
ple [3) 

Xi = [ W2 ], 

X2 = [ W2 W4 W5 W6 ] , 

X3 = [ W3 W4 W5 We ] , 







where Wi € Fg, i = 1,... ,6, and q is some large prime 
number. For the uniform objective a 

fixed sum-rate X]i=i — 5, Algorithm [8] executes the 
following steps: 

• Set Ri — R 2 = i ?3 = 0. 

• j = 1 : Updates of the rate vector R* are selected 
according to the rule ( l64t : 

argmin {R,\i^Ti = {1, 2,3}} = {1, 2,3}, 

User 1 transmit some random linear combination of its 
observation, say = wi + 7 w 2 - Set 

R\ = R\ + 1 = 1. 

« j = 2 : Vector R is updated according to the rule: 

argmin{Rj | i g 7 ^ = {2,3}} = {3}. 

User 3 transmit some random linear combination of its 
observation, say ' = W 3 + Wi + + llwe- Set 

-R3 = ^3 + 1 = 1- 

• J = 3 : Vector R is updated according to the rule: 

argmin{Rj | i g 7^ = {2,3}} = {2}. 

User 2 transmit some random linear combination of its 
observation, say V 2 = 4w2 + 3w4 + 13w5 + 8 we. Set 

R 2 = R 2 + 1 = 1. 

« 7=4: Vector R is updated according to the rule: 

argmin{Rj | i g 7} = {2,3}} = {3}. 

User 3 transmit some random linear combination of its 
observation, say = 9 w 3 + 5^4 + Idws + 17we. Set 

-R3 = ^3 + 1 = 2. 

• J = 5 : Vector R is updated according to the rule: 

argmin{Rj | i g 7^ = {2,3}} = {2}. 

User 2 transmit some random linear combination of its 
(5) 

observation, say V 2 = llw 2 + 2u;4 + ISws + Gwg- Set 

R 2 = R 2 + 1 = 2. 

« R* = R is an optimal DR-rate vector w.r.t. the uniform 
objective and the condition R{M) = 5. 

It can be verified that after this round of communication all 
the users are able to recover the file. 


VI. Introducing Capacity Constraints 
I n this section we explore a data exchange problem where 
the transmissions of each user can be further restricted. For 
instance, we can limit the total number of packets sent by 
each user. Say that user i is not allowed to transmit more than 
Ci packets in Fg. Then, optimization problem (|9|l becomes 

min /i(/3), (73) 

/3ez+ 

where /i(/3) can be obtained from (l24l) by adding capacity 
constraints. 

m 

h{l5) = ‘PijRz), s.t., li eB{gp), (74) 

i=l 

Ri < Ci, Vi g M, 

provided that gp{M) = /3. We also assume that the capacity 
vector c is feasible, i.e., there exists a rate vector R g B{gjj) 
such that the capacity constraints in (l74l i are satisfied. 

In Section |III] we pointed out that the optimality of all 
the algorithms we studied is guaranteed due to the fact that 
the constraint set of the problem (fT^ constitutes a base 
polyhedron of a submodular function. In this section we show 
that by adding individual capacity constraints, the constraint 
set in (fTSl i also forms a base polyhedron of a submodular 
function. This implies that in such a case we can still apply 
every algorithm developed so far in order to obtain an optimal 
DR-rate vector. 

We begin our analysis by defining the restriction of a 
submodular function (see lIZTll for the reference). 

Definition 5. For a submodular function gp : 2^ Z, and 
a vector c g Z™, define a function g'^ : 2^ ^ Z by 

g^S) ^ mm{ 5 ; 3 (V) + c(5 \ V) | V C 5} , V5 C 7W. 

(75) 

The submodular function is called the restriction of gp by 
vector c. 

Theorem 5 (Theorem 8.2.1 in IZTl l. Let gp be restriction of a 
submodular function gp by vector c. Then, g^ is submodular. 

Theorem 6. For a submodular function gp defined in (1211) 
and a feasible capacity vector c w.r.t. problem Gil), the base 
polyhedron B(gp) of the restriction of gp by c, is given by 

B{g}) = {R I R g B{gp), R, < q, Vt g Af} , (76) 

provided that the sum-rate P and the capacity vector c are 
feasible w.r.t. the optimization problem (O. 

Proof of Theorem | 6 ] is provided in Appendix |T] 

From Theorem | 6 ] it follows that the constraint set of (l74l) 
forms a submodular polyhedron R(p^), which further implies 
that all the algorithms developed so far can be applied to 
obtain an optimal DR-rate vector. For instance, with capacity 
constraints. Step 4 of Algorithm [T] becomes 

R}(,) = min{cj(i),5f/3({j(l),7(2),...,7(i)}) (77) 

-gp{{J{l),m,...,J{^-l)})]. 


This modification propagates to Algorithm |2] as well. Sim¬ 
ilarly, at iteration j, Step 4 of Algorithm [T] and Step 3 of 
Algorithm [ 8 ] is modified as follows 

i* = argmin{di(i?* + 1) | i G TJ, s.t., i?* + 1 < Ci}. 

(78) 

Remark 15. If the capacity vector c was not feasible w.r.t. 
problem ( |74] |. then all algorithms considered so far would 
terminate before reaching sum-rate equal to 13. 


Example 5. Let us consider the same problem as in Exam- 


ple| 2 ] 

Xi = [ 

Wi 

W2 

]> 



X 2 = 1 

W2 


W5 

WQ ] , 


X 3 = 

W3 

W4 

W5 

1 , 


where Wi G F^. Let the cost function be Ri + 3i?2 + 2i?3, 
the sum-rate /3 = 5, and the capacity constraints Ci < 2, 
i = 1,2,3. Then, by applying Algorithm |2] with the modifi¬ 
cation GTll, we obtain the following result. 

Rl = min{/5({l}),ci} = 1, 

Rl = min{min{/ 5 ({l, 3 }) - ,/ 5 ({ 2 })}, C 3 } = 2, 

R; = min{min{/ 5 ({l, 2,3}) - i?} - i?;, ^({l, 3}) - i?}, 
/5({2,3})-i?^,/5({2})},C2} = 2. 

Without capacity constraints, as it was the case in Example |2] 
user 3 would transmit 3 packets in F,. 


VIE Conclusion 

In this work we addressed the problem of the data ex¬ 
change, where each user has some side-information about 
the file, and is interested in recovering it. We assumed that 
the users are allowed to “talk” to each other over a noiseless 
broadcast channel. Eor the case when the side information 
is in the form of the linearly coded packets, we provided 
deterministic and randomized polynomial time algorithms for 
finding an optimal communication scheme, w.r.t. a separable 
convex communication cost, that delivers the file to all users. 


Appendix A 
Proof of Theorem[T| 


that the number of linear equations sent by the remaining 
users is 


i ?3 = rank (A{i_ 2 . 3 }) - rank (A{i_ 2 }) (80) 


Rm = rank (Aai) - rank (A^\{„}) 

= A - rank . (81) 

Observe that the number of linear equations each user 
sends depends upon the ordering of users in equations (l79l l 
through (HB. Let j{2),..., j{m) be any ordering of 
2,... ,m. Then, by applying the same approach as above, 
we obtain other feasible rate tuples. 

Rj(2) = rank (A{i ^(2)}) - rank (A{i}) (82) 

i?,( 3 ) = rank (A{i_,( 2 ),,( 3 )}) - rank (A{i^( 2 )}) (83) 

Rjim) = N - rank . (84) 

From (I82li-(l84li. observe that 

m 

RJ{^) =N- rank , t = 2,...,m. 

i—t 

By using this method of ordering, we can reconstruct any 
vertex of the region 

m 

Rj{i) >N- rank (A{i_j( 2 ),... j(t_i)}) , f = 2 ,..., m, 

i—t 

for all permutations j(2),... ,j(rn) of the set \ {!}. 

(85) 


The region in (IHST i is equivalent to 
Y, Ri> N — rank (A_^\g) , VS C M s.t. {1} ^ 5. 

i&S 

Let us denote the above region by TZi. Similarly, for users 2 
through TO, we can define regions 72.2, • ■ •, R-m ■ Let us denote 
by TZint the set of all integer vectors Z™ that belong to the 
cut-set region 72 defined in Q- Then, it is not hard to show 
that 


In order for each user in A4 to reconstruct the file, it is 
necessary for all of them to receive a sufficient number of 
linear combinations over Fg so that the observation rank of 
each user is full. For instance, in order for user 1 to recover 
all N packets of the file, it is sufficient for him to select 
N — £i linear equations from the remaining to — 1 users. In 
this case, user 2 can send to user 1 

7?2 = rank (A{i 2 }) - rank (A^ij) (79) 

of its linear equations, after which user 1 will have observa¬ 
tion rank rank(A{i 2 }). Following this procedure, we have 


Rint — Ri n R2 n • • • n Rm- (86) 

From the discussion above, we know that if R G Rint, then 
it is sufficient for user i to send Ri linear equations separately 
to all the users, which makes the total of (to — l)7?i equations 
over Fq sent by user i. The key property of the linear network 
codes is that there exists one set of Ri linear equations that 
user i can broadcast and simultaneously satisfy demands of 
all the remaining users in Ad, provided that the field size |Fq| 
is large enough ED- Hence, every rate tuple that belongs to 
Rint can be achieved via linear network coding. 


Appendix B 
Proof of Theorem[3] 

Consider two feasible sum-rates jSi and (32 w.r.t. the 
problem (fTSl i. We show that for any A € [0,1] such that 
A/ll -I- (1 — A)/32 G it holds that /i(A/3i -b (1 — A)/l 2 ) < 
Xh{Pi) -b (1 — \)h{j32)- Let and be the optimal 
rate tuples w.r.t. h{j3i) and h{P 2 ), respectively. Note that 

Xhipi) + {1 - X)h{P2) 

m 

= ^(A(^.(i?«) + (l-AV,(Rf))) 

i=l 

> ^ (^.(ARf) + (1 - A)i?f)) = ^ ^ (87) 

i=l i-l 

where (a) follows from the convexity of ipi, \/i € A4, and 
R(^) = AR^^^ + (1 ~ A)R(^). Now, we show that is 
a feasible DL-rate vector for the problem (ITSl i when (3 = 
XPi + (1 — A)/32- 

Since = /3i and R^'^'>{A4) = (32, it follows that 

rW(^M) = XR^^\M) -b (1 - A)R(2)(M) 

= A/ll + (1 - A)/32. (88) 

Since 

> iV-rank(A^\ 5 ), VS C M, i = l,2, 

we have 

i?(^)(5) = AR(i)( 5) + (1 - A)i?(2)(5) 

>N- rank(A^\5), V5 C M. (89) 

From (l88l l and ( |89] | it follows that R^^^ is a feasible Dis¬ 
rate vector w.r.t. optimization problem (IT^ when [3 = A/3i -b 
(1-A)/32- Therefore, ^ + (1 - X)P 2 )- 

Hence, from (l87l i. it follows that 

ii(A^i + (1 - X)h) < Xh{pi) + (1 - XW 2 ), (90) 

which completes the proof. 

Appendix C 
Proof of Lemma[T] 

When 5 n T 7 b 0, the following inequality holds due to 
the submodularity of the rank function 

MS) + MT) 

= rank(A5) -b rank(A7-) — 2{N — (3) 

> rank(A5ur) + rank(A5nr) - 2(A^ - (3) 

= fpisyjr)^ fp{Sr\T). (91) 

To show that the function fp is submodular when /3 > A, it 
is only left to consider the case iSflT =0. Since fp{^) = 0, 
we have 

fp{S) + fp{T) 

= rank(A5) -b rank(A7-) — 2{N — f3) 

> rank(A^ur) - {N -/3) = fpiS U T). (92) 


The inequality in (|9^ directly follows from the submodular¬ 
ity of the rank function. 

rank(A5) -b rank(A7-) — rank(A5uT) > 0 > P — N. 

This completes the proof. 

Appendix D 
Proof of Lemma|2] 

Let us construct the set function y : —7 Z as 

follows 

fO if 5 = 0, 

2/(5) = J//3 (5) if*e5, 

[/^(5U{r}) if* ^5. 

First, we show that = P{y). Let R S D(i/). Then, for 
any S C {1, 2, — 1}, it follows that 

R{S U {*}) < yiS U {*}) = fpiS U {*}). (93) 

Therefore, R € TZ^^\ 

Now, let R € T^L) From (l34l i we have 

R{S U {*}) < fpiS U {*}) = yiS U {*}), (94) 

V5C {1,2,. 

Since the rate vector is positive, (|94] | implies that 

R{S) < MS U {*}) = 2/(5), V5 C {1,2,..., * - 1}. 

(95) 

From ( |94] | and ( l95T l it follows that R G P{Y). Hence, T^L) = 
P{Y). 

Next, we show that function y is fully submodular. For any 
5, T C {1, 2,..., *}, let us consider the following 3 cases 
Case i G S, i 

y{S)+y{T) = fp{S)+fp{T^{i}) 

> /^(5ur)+/M(5nr)U{*}) 

= 2/(5 UT) -b2/(5nT), 

where (a) is due to intersecting submodularity of function 

u- 

Case 1\ i ^ S, i 

y{S) + 2 /(r) = //3(5 u {*}) + /Mr U {*}) 

>/M5 U r U {*}) +/M(5 n r) U {*}) 

= 2/(5 U r) + 2/(5 n r). 

Case 3: i G S, i G T 

2/(5) + 2/(r) = /M5) + /Mr) 

>/M5ur) + /M5nr) 

= y(5 ur)+2/(5 nr). 

Therefore, function y is indeed fully submodular. Hence, 
problem dril l is a linear optimization problem over a sub¬ 
modular polyhedron, and it can be solved by applying 
algorithm similar to Algorithm [T] (see reference E2\ ). The 


only difference is that in this case, the weights in the Step 1 
of Algorithm [T] should be ordered in a non-increasing order. 
If At(i) < 1, then 

^j] = = //3 ({*}), 

Rt{k)[j] = y{St{k) u {i}) - y{St(k-i) u {i}) 

k-1 

= fft{.St(k) u {i}) - ^ Rt(u) b1 - Ri[j], (96) 

u—1 

for A: = 1 , 2 , — 1 . 

If for some r G {1, 2,... , i — 1}, At(r) > 1 > At(r_|_i), 
then 


since Rk[j] — is a partial derivative of <5(A[j]) at coor¬ 
dinate Xk[j], k — 1 , 2 , ...,i — 1 . Summing both sides of 
inequality ( l98b over j from 0 to ^ — 1, we obtain 

i-l 

k^l 

i-l 1-1 

< ^(A,[0] - Xir -2^0, {S{\[j]) - 6{\*)) 

k—1 j—0 

1-1 i-l 2 

j—O k—1 


Ri[j] = yiSt(r) U {l}) - y{St(r)) = 0 , 

Rt{k) [j] = y{St{k) u {i}) - y{St(^k-i) u {i}) 

k-1 

= MSt^k)U{i})-'£Rtiu)[j], (97) 

u—1 

for k = 1,2,... ,i — 1. This completes the proof of this 
lemma. 

Appendix E 
Proof of Lemma[3] 

After j + 1 iterations of the subgradient algorithm, the 
Euclidian distance between A[j -|- 1] and a minimizer A* of 
the dual function S, can be bounded as follows The distance 
from 


Therefore, 

i-l 

2^0,(5(Ab])-,5(A*)) 

j=o 

i—1 1 — 1 i—1 2 

<^(Afc[O]-AJ)2 + ^0|^(i?fc[j]-i?^) . (101) 

k—1 j—0 k—1 

Since, 

i-i 

j2oAmj])-s{x*)) 

j =0 

I 

-E^J - (102) 


i-l 

+ 1 ] ~ 

k^l 
i-l 

-E 

k^l 

2 1 2 
k^l 

i-l i-l 

= E (^fcb1 - Kf - 20, E(^'^b1 - RDAkij] - K) 

k^l k^l 

k^l 

i-l 

<J 2 {Xk[j]-xif- 2 eAS{x[j])- 6 {\*)) 

k^l 

i—1 2 

+ 0|E(^'=b1-^:) , (98) 

k^l 

where the last inequality is due to convexity of function (5(A), 
i.e., 

i-l 

5(Ab]) - 6 {\*) < - RIKMj] - K), (99) 

ic=l 



from dlOlb and (|4^ we obtain 


SCX[l-l])-5{\*)= min 5(Ab]) - 5(V) 
,e{o,i,....i-i} 

^ EEi(a.[0] - xir + eEo EEi [Mj] - R^y 


eEo (srii {Rk[j]y+ eEi (Rir 

+ -ov-i-l n - 


2 e;=o^, 


SEi(Aic[0]-A*) 

2Et>. 


*12 


< 


^j =0 ^3 


-f 


eEo (srii ^47 ]) + (eEi r ^ 


2 EEo^^ 


E;=i(Aic[0]-A^)2 + 2iV2Et>? 


< 


2EE 


,=0 "J 


(103) 


where the last inequality holds because R(JXi) < f/s^Ai) < 
N for any achievable D£-rate vector R. Continuing 










with (11031 1. we have 

6CX[1-1])-S{\*) 

^ {Er=\ Afe [0])'+(srii A^)'+ 2N^ eTJo 

(104) 


Appendix F 
Proof of Lemma|4] 

For a minimizer A* of a dual function <5, let us denote by R 
an optimal solution of the problem (l36T l obtained by applying 
Lemma |2] Since Rk = // 3 ({ 1 , 2,..., i}), and from 

Algorithm [T] and Theorem^ J^k^iRl <,// 3 ({ 1 , 2,..., 
it follows that 

i-l 

Y,Rk-Rl>R*-R^■ (105) 

k^l 

By the formulation of the optimization problem (l33] t, the 
minimum value of the dual function 5 is i?*. Therefore, 

i-l 

'£KiRk-Rl) = R:-R^■ (106) 

k^l 

From Algorithm [T] and Theorem [2l it follows that 

i 

Y^R* (107) 

k^l 

= min < fpi^) ■‘P is a partition of {1,2,..., i} V 

^ IseP J 

Let us denote by S*, a set that belongs to an optimal 
partitioning V* w.r.t. problem (11071) such that i € S*. In 
this case, we have 


YRI = fpisi). 

Now, let us select A* as follows 


A 


* 

k 


1 iffcG5*, 
0 otherwise. 


(108) 


(109) 


To verify that this choice of A* is indeed a dual optimal 
solution, note that from Lemma |2l we have 

YRk = fp{S*). (110) 

kes^ 

Therefore, 

YRk-Rl = 0- ( 111 ) 

kes* 

From ( 11091 ) and (111 11 ). it follows that 


y] xuRk-Ri) 

kG{l,...i—l}\S* 

= R*-R,+ Y. XURl-Rk). 


This is consistent with (11061 ). and hence. A* is indeed a dual 
optimal solution. Therefore, 

i-l 

X]Afc<i-l<m. (113) 

k=l 


Appendix G 
Proof of Lemma[5] 

By Lemma [T] we know that set functions /jv and /n+i, 
defined in (m, are fully submodular. 


/n(5) = 


rank(A5) 

0 


if 

if 5 = 0 . 


(114) 


/n+i(5) 


1 + rank(A 5 ) if 0 7 ^ 5 C M, 
0 if 5 = 0. 


(115) 


Let us denote by R* an optimal vector obtained by applying 
Algorithm | 6 ] From the correctness of Edmonds’ algorithm, 
it directly follows that all faces of a submodular polyhedron 
P{gp) are achievable, i.e., for any SQM, there exists 
a rate vector R such that R{S) = gp{S). Comparing /jv 
and /at+i, we see that all “faces” of polyhedron P(/ 7 v+i) 
expended by 1 compared to polyhedron P{fN) (and they 
are all achievable). Hence, while applying Algorithm | 6 ] for 
/3 = A + 1, we can see that the optimal rate vector R will 
differ from R* in one coordinate. Let 


j* = argmin{di(i?- + 1) | R* + e(i) G P(/Ar+i)} . 

Then, 


Ri — 


R* + l if i=j* 
R* otherwise. 


(116) 


Evaluating costs for j3 = N and /3 = A + 1, we obtain 

m 

h{N) = = Yg’^{R*) + ^AR*). (117) 

i=l i^j 


h{N + l) = Y = Y + !)• 


Comparing (II141 ) and (II151 ). we conclude that h{N) < 
h{N + 1) since tfj is a non-decreasing function. Since h is 
a convex function (see Theorem |2), it immediately follows 
that 13* < A. 

Appendix H 
Proof of Theorem|4] 

Let us start by considering round j = 1 of Algorithm | 6 ] 
All rates are set to zero, i.e., R* = 0, j = 1,..., m. To check 
whether user i belongs to set 7 i, we need to verify whether 
its update belongs to polyhedron P{fp) 

R*{S) + l< fp{S), V5, s.t. iG5, 


( 112 ) 


(119) 












where fjs is defined in ( fTSl l. Since R* is a zero vector, we 
can write the condition (II 191 l as 

1 </3-iV + rank(As), VS C M, s.t. ieS, (120) 

which is equivalent to 

1< min {/3 — A + rank(A 5 )}. (121) 

i&SCM 

It is easy to see that S = {z} is the minimizer of the above 
problem. Hence, i G Ti if 

rank(Ai) > iV-/3, (122) 

which matches the theorem statement for j = 1. 

Say that user i belongs to 7i and that he is scheduled 
to transmit in the first round according to the cost function. 
Thus, user i transmits 

(123) 

where is appropriately chosen vector. All the remaining 
users update their observation matrix by appending vector 
to it 

AfcUu(i), VfceM\{z}. (124) 

In the next round we reduce parameter /3 by 1, and again 
ask the same question whether user i belongs to T2 for the 
updated set of observations. Combining (I122I I and (I124I I it is 
easy to see that in round j, the condition (I1221 l becomes 

rank ^A^ U U • • • U > N — {/3 — j + 1), 

(125) 

which completes the proof. 

Appendix I 
Proof of Theorem|6] 

Let R be any feasible rate vector w.r.t. the problem (l74l i. 


i.e., 

R{S)<gpiS), VS CM, (126) 

R{S)<c{S), VS CM, (127) 

R{M)=gp{M)=l3. (128) 

By substituting S with Af \ 5 in (11261 1. we obtain 

R{M \S)< gp{M \ S), VS C M, (129) 

This can be rewritten as 

R{S)>R{M)-gp{M\S) 

= P-gp{M\S), VS CM, (130) 


where the last equality comes from (I1281 l From (I1271 i. (I1281 l. 
and (II 3 OI 1 it follows that 

9 piM)-gp{M\V) <RiV) <c{V), VV C M. (131) 
From (II 3 II 1 . we have that 

gpiM) <gpiM\V) + c(y), VV C M. (132) 


From ( TTSI i and ( 1128b . we conclude that 

gl{M)=gp{M) = p. (133) 

Hence, R(M) = g^{M). Since Ri < Ci, it follows that 

R{S) = R{V) + RiS \ V) 

<gf){V)+c{S\V), VV,S s.t. VCSCM. (134) 

Finally (1134b implies that 

R{S) < min {gp{V) + c{S \ V) | V C 5} , VS C M. 

Hence, R G R(g^)- 

Conversely, let R be such that R G R(g^)- Then, 

R(S) < g^(S) < gp(S) + c(0) = g^(S), VS C M, 

(135) 

R(S) < g^^(S) < gp{%) + c{S) = c{S), VS C M, 

(136) 

R{M)=g%M)=p (137) 

where the second inequality in (1135b and (1136b directly 

follows from (l75b . From (1135b . (1136b . and (1137b it follows 

that 

'RGB{gp), s.t. Ri<Ci, ViGM. (138) 

This completes the proof. 
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